Scalable critical-path analysis and optimization guidance for hybrid MPI-CUDA applications

نویسندگان
چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MPI- and CUDA- implementations of modal finite difference method for P-SV wave propagation modeling

Among different discretization approaches, Finite Difference Method (FDM) is widely used for acoustic and elastic full-wave form modeling. An inevitable deficit of the technique, however, is its sever requirement to computational resources. A promising solution is parallelization, where the problem is broken into several segments, and the calculations are distributed over different processors. ...

متن کامل

Near–Critical Path Analysis: A Tool for Parallel Program Optimization

Program activity graphs (PAGs) can be constructed from timestamped traces of appropriate execution events. Information about the activities on the k longest execution paths is useful in the analysis of parallel program performance. In this paper, four algorithms for finding the near–critical paths of PAGs are compared, including a best–first search (BFS) algorithm that is worst–case asymptotica...

متن کامل

An MPI-CUDA Implementation and Optimization for Parallel Sparse Equations and Least Squares (LSQR)

LSQR (Sparse Equations and Least Squares) is a widely used Krylov subspace method to solve large-scale linear systems in seismic tomography. This paper presents a parallel MPI-CUDA implementation for LSQR solver. On CUDA level, our contributions include: (1) utilize CUBLAS and CUSPARSE to compute major steps in LSQR; (2) optimize memory copy between host memory and device memory; (3) develop a ...

متن کامل

Runtime affinity optimization for hybrid MPI + OpenMP solvers

In the physical sciences, it is not uncommon to encounter parallel numerical codes in which computation is partitioned between solution of multiple fundamentally different problems, related in a pipelined manner through their state variables. Such a multi-phase scheme might prove advantageous by reducing the “spin up” and “wind down” time that would otherwise be incurred when using separate cod...

متن کامل

Workflow Timed Critical Path Optimization

Approaches to shorten workflow execution time have been discussed in many area of computer engineering such as parallel and distributed systems, a computer circuit, and PERT chart for project management. To optimize workflow model structure of workflow, an approach with corresponding algorithms is proposed to cut timed critical path of workflow schema, which has the longest average execution ti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: The International Journal of High Performance Computing Applications

سال: 2016

ISSN: 1094-3420,1741-2846

DOI: 10.1177/1094342016661865